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Abstract. Rule-based expert systems may be structurally and functionally mapped onto a 
special class of neural networks called expert networks. This mapping lends itself to adaptation 
of connectionist learning strategies for the expert networks. Following a process introduced by 
Kuncicky, Hruska, and Lacher, a parsing algorithm to translate CLIPS rules into a network of 
interconnected assertion and operation nodes has been developed. The translation of CLIPS 
rules to an expert network and back again is illustrated. Measures of uncertainty similar to those 
used in MYCIN-like systems are introduced into the CLIPS system and techniques for combining 
and firing nodes in the network based on rule-firing with these certainty factors in the expert 
system are presented. Several learning algorithms are under study which automate the process 
of attaching certainty factors to rules. 


EXPERT NETWORKS 

The idea of mapping an expert system onto a neural network in order to make the most 
of what each technology offers is a timely one [Fu and Fu 1990] [Gallant 1988] [Hall and 
Romaniuk 1990] [Kuncicky 1990] [Kuncicky et al. 1991] [Lacher et al. 1991a] [Towell 
et al.1990]. Hruska, Kuncicky, and Lacher define the special class of networks created 
from an expert system as expert networks. In these systems, rules of the form if A then 
B where A and B are assertions are mapped in an intuitive manner to nodes labelled 
A and B with a forward connection tying the two together. In expert systems which 
incorporate uncertainty, the certainty factor, a value between —1 and +1, is mapped to 
the connection strength in the network. 

The functioning of the network is also based on the expert system, with the combining 
and firing functions of the network nodes derived from the inference engine functions for 
the expert system. Inference engine functions which have counterparts in the network 
include those for combining evidence for rules that have consequents in common, methods 
of handling conjunction and negation in rules, thresholding functions, and those functions 
that govern firing of rules. The nodes axe not simple perceptron-type nodes, but rather 
axe more complex, based on how the expert system works internally. When activated 
with input, an expert network functions identically to the expert system inference engine 
operating on the rule base from which the network was derived. 

One of the most compelling reasons for pursuing the mapping of expert system con- 
cepts onto neural networks is the possibility for introducing connectionist learning tech- 
niques into traditional Al systems. Automation of even part of the expensive ana time- 
consuming knowledge acquisition process is a valuable contribution of this technology. 

1 Research partially supported by the Florida High Technology and Industry Council, the US Office of Naval 
Research, and Oak Ridge Associated Universities. 
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There are several levels of knowledge acquisition for an expert system [Hruska et al. 
1991a][Hruska et al. 1991b]. These include refining or adjusting the certainty factors 
for rules, forming new rules from existing concepts, and forming new concepts. Ideally, 
most of the knowledge acquisition process could be done directly from data. This training 
data consists of examples of input information and the associated desired (correct) output. 
Bypassing the knowledge elicitation process in which a hum am expert must express their 
expertise in the form of if-then rules is an ultimate goal. Automation would be useful at 
any of the levels listed above. 

The current research efforts reported in this paper focus on the area of automating 
the r efin ement of knowledge. Certainty factors represent perhaps the most subtle form 
of knowledge in a system, the confidence which the expert has in the consequent being 
related to the antecedent of the rule. In decision-making, the use of certainty factors 
enables the system to rank outcomes of the system based on the certainty factors attached 
to them. In CLIPS, we are in the process of implementing certainty factors via the declare 
option available in CLIPS rule formats. 


THE TRANSLATION PROCESS 


In order to apply the connectionist learning techniques under development, it is useful to 
translate the expert system rule base to expert network format. The process of translating 
an expert system rule base to the type of expert network described above was prototyped 
by Kuncicky and colleagues [Kuncicky 1990] [Kuncicky et al. 1991] for the expert system 
M.l [Ml 1986]. Following these ideas, an implementation of the translation process was 
written for a subset of CLIPS. Preliminary work on this project is summarized in [Johnson 
and Franke 1989]. 

The translation is performed in two steps. First, the rule base is simplified somewhat 
by replacing rules of designated complexity (such as those with disjunctions in their 
antecedents) with several simpler but collectively equivalent rules. For example, rules 
(with cf certainty factors) of the form 

if A or B then C (cf) 

are simplified to 


if A then C (cf) 


if B then C (cf). 

Another type of rule base simplification is reduction of the rule 

if A and (B or C) then D (cf) 


to 

if A and B then D (cf) 
if A and C then D (cf) 

which distributes conjunction over disjunction. Rules of the form 

if not(A or B) then C (cf) 
are simplified via DeMorgan’s law to 

if not A and not B then C (cf). 

Simplification of conjunction in the consequent converts rules of the form 

if A then B and C (cf) 
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to the equivalent set of rules 


if A then B (cf) 
if A then C (cf). 

Through this process, disjunctions in the antecedents and conjunctions in the conse- 
quents are eliminated. Disjunctions in the consequents axe not allowed. 

After the rule base simplification transformations are complete, the rule base is trans- 
lated into an expert network. At this point, there axe three basic forms of rules: regular 
rules, rules witn conjunctions in the antecedent, and rules with negations in the an- 
tecedent. These three types of rules may be compounded to form rules of arbitrary 
complexity in the antecedent, a regular rule, of the form 

if A then B (cf) 

is converted to a portion of an expert network as 



Rules with conjunctions and negations involve creation of special operator type nodes. 
Connections from assertions to these operator nodes are fixed with a weight of 1.0. 

Typically, a rule involving a conjunction such as 

if A and B then C (cf) 

is converted to 



while rules with negations of the form 

if not A then B (cf) 

become 



A short example of the translations process in CLIPS is shown in Tables 1 and 2 and 
Figure 1. 

As of this date, the translation program handles input rule bases which contain 
defrules, deffacts, and asserts. Plans axe now underway to extend the flexibility of this 
prototype to accept variables, pattern matching, and retraction. 
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(defrule clear 

(declare (certainty 0.96)) 

; If the barometer is high and there are no clouds then it is clear, 
(and (high) 

(not (cloudy) ) ) 

-> 

(printout t "Clear” crlf) 

) 

(defrule sky-water 
(declare (certainty 0.7)) 

; if the barometer is low or it is cloudy then it is rainy. 

(or (not (high) ) 

(cloudy) ) 

-> 

(assert (precipitation) ) 

) 

(defrule snowy 
(declare (certainty 0.8)) 

; If there is precipitation but it is not hot then time for snow, 
(and (precipitation) 

(not (hot))) 

-> 

(printout t "Snow!" crlf) 

) 

(defrule rainy 
(declare (certainty 0.9)) 

; If there is precipitation and it is hot then it mu3t be rainy. 

(and (precipitation) 

(hot) ) 

-> 

(printout t "Get your umbrellas." crlf) 

) 

Table 1 . Original Rule Base 


(defrule clear-1 
(declare (certainty 0.96)) 

; From rule number 1 
(high) 

(not (cloudy) ) 

-> 

^ (printout t "Clear" crlf) 

(defrule sky-water-2 
(declare (certainty 0.7)) 

; From rule number 2 
(not (high)) 

-> 

(assert (precipitation) ) 

(defrule sky-water-3 
(declare (certainty 0.7)) 

; From rule number 2 
(cloudy) 

-> 

^ (assert (precipitation) ) 

(defrule snowy-4 
(declare (certainty 0.8)) 

; From rule number 3 
(precipitation) 

(not (hot)) 

-> 

(printout t "Snow!" crlf) 

(defrule rainy-5 
(declare (certainty 0.9)) 

; From rule number 4 
(precipitation) 

(hot) 

•> 

(printout t "Get your umbrellas." crlf) 


Table 2. After Rule Base Transformations 
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Figure 1. Expert Network From Rule Base 


PROCESSING CERTAINTY FACTORS IN CLIPS 

The study of M.l and its maimer of handling certainty factors has led us to contemplation 
of improvements which could be made in porting this process to CLIPS. Our introduction 
of certainty factors into CLIPS rule bases lea to the necessity of altering the CLIPS 
source code to implement the processing of certainty factor information. This includes a 
set of algorithms for evidence accumulation, combining evidence, outputting a certainty 
factor value for the rule, and firing the rule based on that certainty factor output. One 
possibility, currently under consideration by Lacher and Traphan [Traphan 1991] is to 
build and improve on the inference engine for the expert system shell EMYCIN (upon 
which M.l is also based). EMYCIN uses a threshold of 0.2 to determine whether a 
rule will fire or not. The proposal is to smooth this thresholding function, resulting in a 
system which yields analog values very close to those of the original EMYCIN system, but 
which is continuously differentiable. This feature will enable gradient descent learning 
techniques to be applied in a more consistent manner over the entire range of values. 
This work is currently in progress. 

Learning techniques developed for expert networks include Goal-Directed Monte 
Carlo Search [Hruska et al. 1991b] and Expert System Backpropagation [Lacher et al. 
1991b]. The first of these is designed using the principles of reinforcement learning with 
increasing levels of noise applied to the connections between nodes. The second is an 
adaptation of the standard backpropagation of error learning algorithm which uses gra- 
dient descent to find the weight settings (certainty factors) which minimize the difference 
between the system’s output and the ideal (correct) output. The major innovation of Ex- 
pert System Backpropagation is the use of the expert system’s inference engine functions 
in the complex nodes of an expert network to perform the computations necessary for 
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gradient descent. Both of these learning algorithms have been tested on the M.l system 
and axe currently being upgraded to work with the altered version of CLIPS described 
above. 


CONCLUSION 

A system for translating a subset of CLIPS expert system rule bases into expert networks 
has been prototyped and is under ongoing development. CLIPS rules will have certainty 
factors attached to them which may be used to express uncertainty in the inferencing 
process. A proposal for smoothing the evidenciary combining and firing functions of 
traditional EMYCIN-like systems is described. Learning algorithms for expert networks 
are briefly described. As work progresses on this system, we draw nearer to realization 
of a tool for automating the knowledge acquisition process involved in traditional AI 
systems. 
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